Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 78
Filtrar
1.
J Clin Oncol ; 42(9): 1001-1010, 2024 Mar 20.
Artículo en Inglés | MEDLINE | ID: mdl-38320222

RESUMEN

PURPOSE: This study assessed the prevalence of specific major adverse financial events (AFEs)-bankruptcies, liens, and evictions-before a cancer diagnosis and their association with later-stage cancer at diagnosis. METHODS: Patients age 20-69 years diagnosed with cancer during 2014-2015 were identified from the Seattle, Louisiana, and Georgia SEER population-based cancer registries. Registry data were linked with LexisNexis consumer data to identify patients with a history of court-documented AFEs before cancer diagnosis. The association of AFEs and later-stage cancer diagnoses (stages III/IV) was assessed using separate sex-specific multivariable logistic regression. RESULTS: Among 101,649 patients with cancer linked to LexisNexis data, 36,791 (36.2%) had a major AFE reported before diagnosis. The mean and median timing of the AFE closest to diagnosis were 93 and 77 months, respectively. AFEs were most common among non-Hispanic Black, unmarried, and low-income patients. Individuals with previous AFEs were more likely to be diagnosed with later-stage cancer than individuals with no AFE (males-odds ratio [OR], 1.09 [95% CI, 1.03 to 1.14]; P < .001; females-OR, 1.18 [95% CI, 1.13 to 1.24]; P < .0001) after adjusting for age, race, marital status, income, registry, and cancer type. Associations between AFEs prediagnosis and later-stage disease did not vary by AFE timing. CONCLUSION: One third of newly diagnosed patients with cancer had a major AFE before their diagnosis. Patients with AFEs were more likely to have later-stage diagnosis, even accounting for traditional measures of socioeconomic status that influence the stage at diagnosis. The prevalence of prediagnosis AFEs underscores financial vulnerability of patients with cancer before their diagnosis, before any subsequent financial burden associated with cancer treatment.


Asunto(s)
Población Negra , Neoplasias , Femenino , Masculino , Estados Unidos/epidemiología , Humanos , Adulto Joven , Adulto , Persona de Mediana Edad , Anciano , Georgia/epidemiología , Sistema de Registros , Neoplasias/diagnóstico , Neoplasias/epidemiología
2.
J Biomed Inform ; 149: 104576, 2024 01.
Artículo en Inglés | MEDLINE | ID: mdl-38101690

RESUMEN

INTRODUCTION: Machine learning algorithms are expected to work side-by-side with humans in decision-making pipelines. Thus, the ability of classifiers to make reliable decisions is of paramount importance. Deep neural networks (DNNs) represent the state-of-the-art models to address real-world classification. Although the strength of activation in DNNs is often correlated with the network's confidence, in-depth analyses are needed to establish whether they are well calibrated. METHOD: In this paper, we demonstrate the use of DNN-based classification tools to benefit cancer registries by automating information extraction of disease at diagnosis and at surgery from electronic text pathology reports from the US National Cancer Institute (NCI) Surveillance, Epidemiology, and End Results (SEER) population-based cancer registries. In particular, we introduce multiple methods for selective classification to achieve a target level of accuracy on multiple classification tasks while minimizing the rejection amount-that is, the number of electronic pathology reports for which the model's predictions are unreliable. We evaluate the proposed methods by comparing our approach with the current in-house deep learning-based abstaining classifier. RESULTS: Overall, all the proposed selective classification methods effectively allow for achieving the targeted level of accuracy or higher in a trade-off analysis aimed to minimize the rejection rate. On in-distribution validation and holdout test data, with all the proposed methods, we achieve on all tasks the required target level of accuracy with a lower rejection rate than the deep abstaining classifier (DAC). Interpreting the results for the out-of-distribution test data is more complex; nevertheless, in this case as well, the rejection rate from the best among the proposed methods achieving 97% accuracy or higher is lower than the rejection rate based on the DAC. CONCLUSIONS: We show that although both approaches can flag those samples that should be manually reviewed and labeled by human annotators, the newly proposed methods retain a larger fraction and do so without retraining-thus offering a reduced computational cost compared with the in-house deep learning-based abstaining classifier.


Asunto(s)
Aprendizaje Profundo , Humanos , Incertidumbre , Redes Neurales de la Computación , Algoritmos , Aprendizaje Automático
3.
Cancer Epidemiol Biomarkers Prev ; 32(11): 1591-1598, 2023 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-37594474

RESUMEN

INTRODUCTION: Health care procedures including cancer screening and diagnosis were interrupted due to the COVID-19 pandemic. The extent of this impact on cancer care in the United States is not fully understood. We investigated pathology report volume as a reflection of trends in oncology services pre-pandemic and during the pandemic. METHODS: Electronic pathology reports were obtained from 11 U.S. central cancer registries from NCI's SEER Program. The reports were sorted by cancer site and document type using a validated algorithm. Joinpoint regression was used to model temporal trends from January 2018 to February 2020, project expected counts from March 2020 to February 2021 and calculate observed-to-expected ratios. Results were stratified by sex, age, cancer site, and report type. RESULTS: During the first 3 months of the pandemic, pathology report volume decreased by 25.5% and 17.4% for biopsy and surgery reports, respectively. The 12-month O/E ratio (March 2020-February 2021) was lowest for women (O/E 0.90) and patients 65 years and older (O/E 0.91) and lower for cancers with screening (melanoma skin, O/E 0.86; breast, O/E 0.88; lung O/E 0.89, prostate, O/E 0.90; colorectal, O/E 0.91) when compared with all other cancers combined. CONCLUSIONS: These findings indicate a decrease in cancer diagnosis, likely due to the COVID-19 pandemic. This decrease in the number of pathology reports may result in a stage shift causing a subsequent longer-term impact on survival patterns. IMPACT: Investigation on the longer-term impact of the pandemic on pathology services is vital to understand if cancer care delivery levels continue to be affected.


Asunto(s)
COVID-19 , Melanoma , Masculino , Humanos , Femenino , Estados Unidos/epidemiología , Programa de VERF , Pandemias , Incidencia , COVID-19/epidemiología , Sistema de Registros
4.
J Clin Oncol ; 41(24): 4045-4053, 2023 08 20.
Artículo en Inglés | MEDLINE | ID: mdl-37267580

RESUMEN

Data-driven basic, translational, and clinical research has resulted in improved outcomes for children, adolescents, and young adults (AYAs) with pediatric cancers. However, challenges in sharing data between institutions, particularly in research, prevent addressing substantial unmet needs in children and AYA patients diagnosed with certain pediatric cancers. Systematically collecting and sharing data from every child and AYA can enable greater understanding of pediatric cancers, improve survivorship, and accelerate development of new and more effective therapies. To accomplish this goal, the Childhood Cancer Data Initiative (CCDI) was launched in 2019 at the National Cancer Institute. CCDI is a collaborative community endeavor supported by a 10-year, $50-million (in US dollars) annual federal investment. CCDI aims to learn from every patient diagnosed with a pediatric cancer by designing and building a data ecosystem that facilitates data collection, sharing, and analysis for researchers, clinicians, and patients across the cancer community. For example, CCDI's Molecular Characterization Initiative provides comprehensive clinical molecular characterization for children and AYAs with newly diagnosed cancers. Through these efforts, the CCDI strives to provide clinical benefit to patients and improvements in diagnosis and care through data-focused research support and to build expandable, sustainable data resources and workflows to advance research well past the planned 10 years of the initiative. Importantly, if CCDI demonstrates the success of this model for pediatric cancers, similar approaches can be applied to adults, transforming both clinical research and treatment to improve outcomes for all patients with cancer.


Asunto(s)
Neoplasias , Adolescente , Estados Unidos/epidemiología , Humanos , Niño , Adulto Joven , Neoplasias/terapia , Ecosistema , Recolección de Datos , National Cancer Institute (U.S.)
5.
PLoS One ; 18(3): e0280584, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-36943829

RESUMEN

This retrospective observational study aimed to gain a better understanding of the protective duration of prior SARS-CoV-2 infection against reinfection. The objectives were two-fold: to assess the durability of immunity to SARS-CoV-2 reinfection among initially unvaccinated individuals with previous SARS-CoV-2 infection, and to evaluate the crude SARS-CoV-2 reinfection rate and associated risk factors. During the pandemic era time period from February 29, 2020, through April 30, 2021, 144,678,382 individuals with SARS-CoV-2 molecular diagnostic or antibody test results were studied. Rates of reinfection among index-positive individuals were compared to rates of infection among index-negative individuals. Factors associated with reinfection were evaluated using multivariable logistic regression. For both objectives, the outcome was a subsequent positive molecular diagnostic test result. Consistent with prior findings, the risk of reinfection among index-positive individuals was 87% lower than the risk of infection among index-negative individuals. The duration of protection against reinfection was stable over the median 5 months and up to 1-year follow-up interval. Factors associated with an increased reinfection risk included older age, comorbid immunologic conditions, and living in congregate care settings; healthcare workers had a decreased reinfection risk. This large US population-based study suggests that infection induced immunity is durable for variants circulating pre-Delta predominance.


Asunto(s)
COVID-19 , SARS-CoV-2 , Humanos , Reinfección/epidemiología , COVID-19/epidemiología , Anticuerpos , Personal de Salud
6.
JAMIA Open ; 5(3): ooac075, 2022 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-36110150

RESUMEN

Objective: We aim to reduce overfitting and model overconfidence by distilling the knowledge of an ensemble of deep learning models into a single model for the classification of cancer pathology reports. Materials and Methods: We consider the text classification problem that involves 5 individual tasks. The baseline model consists of a multitask convolutional neural network (MtCNN), and the implemented ensemble (teacher) consists of 1000 MtCNNs. We performed knowledge transfer by training a single model (student) with soft labels derived through the aggregation of ensemble predictions. We evaluate performance based on accuracy and abstention rates by using softmax thresholding. Results: The student model outperforms the baseline MtCNN in terms of abstention rates and accuracy, thereby allowing the model to be used with a larger volume of documents when deployed. The highest boost was observed for subsite and histology, for which the student model classified an additional 1.81% reports for subsite and 3.33% reports for histology. Discussion: Ensemble predictions provide a useful strategy for quantifying the uncertainty inherent in labeled data and thereby enable the construction of soft labels with estimated probabilities for multiple classes for a given document. Training models with the derived soft labels reduce model confidence in difficult-to-classify documents, thereby leading to a reduction in the number of highly confident wrong predictions. Conclusions: Ensemble model distillation is a simple tool to reduce model overconfidence in problems with extreme class imbalance and noisy datasets. These methods can facilitate the deployment of deep learning models in high-risk domains with low computational resources where minimizing inference time is required.

7.
Am J Epidemiol ; 191(12): 2075-2083, 2022 11 19.
Artículo en Inglés | MEDLINE | ID: mdl-35872590

RESUMEN

Follow-up of US cohort members for incident cancer is time-consuming, is costly, and often results in underascertainment when the traditional methods of self-reporting and/or medical record validation are used. We conducted one of the first large-scale investigations to assess the feasibility, methods, and benefits of linking participants in the US Radiologic Technologists (USRT) Study (n = 146,022) with the majority of US state or regional cancer registries. Follow-up of this cohort has relied primarily on questionnaires (mailed approximately every 10 years) and linkage with the National Death Index. We compared the level of agreement and completeness of questionnaire/death-certificate-based information with that of registry-based (43 registries) incident cancer follow-up in the USRT cohort. Using registry-identified first primary cancers from 1999-2012 as the gold standard, the overall sensitivity was 46.5% for self-reports only and 63.0% for both self-reports and death certificates. Among the 37.0% false-negative reports, 27.8% were due to dropout, while 9.2% were due to misreporting. The USRT cancer reporting patterns differed by cancer type. Our study indicates that linkage to state cancer registries would greatly improve completeness and accuracy of cancer follow-up in comparison with questionnaire self-reporting. These findings support ongoing development of a national US virtual pooled registry with which to streamline cohort linkages.


Asunto(s)
Certificado de Defunción , Neoplasias , Humanos , Estudios de Cohortes , Autoinforme , Incidencia , Neoplasias/epidemiología , Sistema de Registros
8.
JAMIA Open ; 5(2): ooac049, 2022 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-35721398

RESUMEN

Objectives: The International Classification of Childhood Cancer (ICCC) facilitates the effective classification of a heterogeneous group of cancers in the important pediatric population. However, there has been no development of machine learning models for the ICCC classification. We developed deep learning-based information extraction models from cancer pathology reports based on the ICD-O-3 coding standard. In this article, we describe extending the models to perform ICCC classification. Materials and Methods: We developed 2 models, ICD-O-3 classification and ICCC recoding (Model 1) and direct ICCC classification (Model 2), and 4 scenarios subject to the training sample size. We evaluated these models with a corpus consisting of 29 206 reports with age at diagnosis between 0 and 19 from 6 state cancer registries. Results: Our findings suggest that the direct ICCC classification (Model 2) is substantially better than reusing the ICD-O-3 classification model (Model 1). Applying the uncertainty quantification mechanism to assess the confidence of the algorithm in assigning a code demonstrated that the model achieved a micro-F1 score of 0.987 while abstaining (not sufficiently confident to assign a code) on only 14.8% of ambiguous pathology reports. Conclusions: Our experimental results suggest that the machine learning-based automatic information extraction from childhood cancer pathology reports in the ICCC is a reliable means of supplementing human annotators at state cancer registries by reading and abstracting the majority of the childhood cancer pathology reports accurately and reliably.

9.
medRxiv ; 2022 Feb 26.
Artículo en Inglés | MEDLINE | ID: mdl-35233580

RESUMEN

IMPORTANCE: Better understanding of the protective duration of prior SARS-CoV-2 infection against reinfection is needed. OBJECTIVE: Primary: To assess the durability of immunity to SARS-CoV-2 reinfection among initially unvaccinated individuals with previous SARS-CoV-2 infection. Secondary: Evaluate the crude SARS-CoV-2 reinfection rate and associated characteristics. DESIGN AND SETTING: Retrospective observational study of HealthVerity data among 144,678,382 individuals, during the pandemic era through April 2021. PARTICIPANTS: Individuals studied had SARS-CoV-2 molecular diagnostic or antibody index test results from February 29 through December 9, 2020, with ≥365 days of pre-index continuous closed medical enrollment, claims, or electronic health record activity. MAIN OUTCOMES AND MEASURES: Rates of reinfection among index-positive individuals were compared to rates of infection among index-negative individuals. Factors associated with reinfection were evaluated using multivariable logistic regression. For both objectives, the outcome was a subsequent positive molecular diagnostic test result. RESULTS: Among 22,786,982 individuals with index SARS-CoV-2 laboratory test data (2,023,341 index positive), the crude rate of reinfection during follow-up was significantly lower (9.89/1,000-person years) than that of primary infection (78.39/1,000 person years). Consistent with prior findings, the risk of reinfection among index-positive individuals was 87% lower than the risk of infection among index-negative individuals (hazard ratio, 0.13; 95% CI, 0.13, 0.13). The cumulative incidence of reinfection among index-positive individuals and infection among index-negative individuals was 0.85% (95% CI: 0.82%, 0.88%) and 6.2% (95% CI: 6.1%, 6.3%), respectively, over follow-up of 375 days. The duration of protection against reinfection was stable over the median 5 months and up to 1-year follow-up interval. Factors associated with an increased reinfection risk included older age, comorbid immunologic conditions, and living in congregate care settings; healthcare workers had a decreased reinfection risk. CONCLUSIONS AND RELEVANCE: This large US population-based study demonstrates that SARS-CoV-2 reinfection is uncommon among individuals with laboratory evidence of a previous infection. Protection from SARS-CoV-2 reinfection is stable up to one year. Reinfection risk was primarily associated with age 85+ years, comorbid immunologic conditions and living in congregate care settings; healthcare workers demonstrated a decreased reinfection risk. These findings suggest that infection induced immunity is durable for variants circulating prior to Delta. KEY POINTS: Question: How long does prior SARS-CoV-2 infection provide protection against SARS-CoV-2 reinfection?Finding: Among >22 million individuals tested February 2020 through April 2021, the relative risk of reinfection among those with prior infection was 87% lower than the risk of infection among individuals without prior infection. This protection was durable for up to a year. Factors associated with increased likelihood of reinfection included older age (85+ years), comorbid immunologic conditions, and living in congregate care settings; healthcare workers had lower risk.Meaning: Prior SARS-CoV-2 infection provides a durable, high relative degree of protection against reinfection.

10.
IEEE J Biomed Health Inform ; 26(6): 2796-2803, 2022 06.
Artículo en Inglés | MEDLINE | ID: mdl-35020599

RESUMEN

Recent applications ofdeep learning have shown promising results for classifying unstructured text in the healthcare domain. However, the reliability of models in production settings has been hindered by imbalanced data sets in which a small subset of the classes dominate. In the absence of adequate training data, rare classes necessitate additional model constraints for robust performance. Here, we present a strategy for incorporating short sequences of text (i.e. keywords) into training to boost model accuracy on rare classes. In our approach, we assemble a set of keywords, including short phrases, associated with each class. The keywords are then used as additional data during each batch of model training, resulting in a training loss that has contributions from both raw data and keywords. We evaluate our approach on classification of cancer pathology reports, which shows a substantial increase in model performance for rare classes. Furthermore, we analyze the impact of keywords on model output probabilities for bigrams, providing a straightforward method to identify model difficulties for limited training data.


Asunto(s)
Reproducibilidad de los Resultados , Recolección de Datos , Humanos
11.
J Biomed Inform ; 125: 103957, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34823030

RESUMEN

In the last decade, the widespread adoption of electronic health record documentation has created huge opportunities for information mining. Natural language processing (NLP) techniques using machine and deep learning are becoming increasingly widespread for information extraction tasks from unstructured clinical notes. Disparities in performance when deploying machine learning models in the real world have recently received considerable attention. In the clinical NLP domain, the robustness of convolutional neural networks (CNNs) for classifying cancer pathology reports under natural distribution shifts remains understudied. In this research, we aim to quantify and improve the performance of the CNN for text classification on out-of-distribution (OOD) datasets resulting from the natural evolution of clinical text in pathology reports. We identified class imbalance due to different prevalence of cancer types as one of the sources of performance drop and analyzed the impact of previous methods for addressing class imbalance when deploying models in real-world domains. Our results show that our novel class-specialized ensemble technique outperforms other methods for the classification of rare cancer types in terms of macro F1 scores. We also found that traditional ensemble methods perform better in top classes, leading to higher micro F1 scores. Based on our findings, we formulate a series of recommendations for other ML practitioners on how to build robust models with extremely imbalanced datasets in biomedical NLP applications.


Asunto(s)
Procesamiento de Lenguaje Natural , Neoplasias , Registros Electrónicos de Salud , Humanos , Aprendizaje Automático , Redes Neurales de la Computación
12.
CA Cancer J Clin ; 72(3): 287-300, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-34964981

RESUMEN

Generating evidence on the use, effectiveness, and safety of new cancer therapies is a priority for researchers, health care providers, payers, and regulators given the rapid pace of change in cancer diagnosis and treatments. The use of real-world data (RWD) is integral to understanding the utilization patterns and outcomes of these new treatments among patients with cancer who are treated in clinical practice and community settings. An initial step in the use of RWD is careful study design to assess the suitability of an RWD source. This pivotal process can be guided by using a conceptual model that encourages predesign conceptualization. The primary types of RWD included are electronic health records, administrative claims data, cancer registries, and specialty data providers and networks. Careful consideration of each data type is necessary because they are collected for a specific purpose, capturing a set of data elements within a certain population for that purpose, and they vary by population coverage and longitudinality. In this review, the authors provide a high-level assessment of the strengths and limitations of each data category to inform data source selection appropriate to the study question. Overall, the development and accessibility of RWD sources for cancer research are rapidly increasing, and the use of these data requires careful consideration of composition and utility to assess important questions in understanding the use and effectiveness of new therapies.


Asunto(s)
Almacenamiento y Recuperación de la Información , Oncología Médica , Registros Electrónicos de Salud , Humanos , Sistema de Registros , Proyectos de Investigación
13.
J Registry Manag ; 49(4): 109-113, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-37260810

RESUMEN

The National Cancer Institute (NCI) Surveillance, Epidemiology, and End Results (SEER) program is continuously exploring opportunities to augment its already extensive collection of data, enhance the quality of reported cancer information, and contribute to more comprehensive analyses of cancer burden. This manuscript describes a recent linkage of the LexisNexis longitudinal residential history data with 11 SEER registries and provides estimates of the inter-state mobility of SEER cancer patients. To identify mobility from one state to another, we used state postal abbreviations to generate state-level residential histories. From this, we determined how often cancer patients moved from state-to-state. The results in this paper provide information on the linkage with LexisNexis data and useful information on state-to-state residential mobility patterns of a large portion of US cancer patients for the most recent 1-, 2-, 3-, 4-, and 5-year periods. We show that mobility patterns vary by geographic area, race/ethnicity and age, and cancer patients tend to move less than the general population.


Asunto(s)
Neoplasias , Humanos , Estados Unidos/epidemiología , Neoplasias/epidemiología , Sistema de Registros , Dinámica Poblacional , Etnicidad , Programa de VERF
14.
JCO Clin Cancer Inform ; 5: 881-896, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-34428097

RESUMEN

Cancer Informatics for Cancer Centers (CI4CC) is a grassroots, nonprofit 501c3 organization intended to provide a focused national forum for engagement of senior cancer informatics leaders, primarily aimed at academic cancer centers anywhere in the world but with a special emphasis on the 70 National Cancer Institute-funded cancer centers. This consortium has regularly held topic-focused biannual face-to-face symposiums. These meetings are a place to review cancer informatics and data science priorities and initiatives, providing a forum for discussion of the strategic and pragmatic issues that we faced at our respective institutions and cancer centers. Here, we provide meeting highlights from the latest CI4CC Symposium, which was delayed from its original April 2020 schedule because of the COVID-19 pandemic and held virtually over three days (September 24, October 1, and October 8) in the fall of 2020. In addition to the content presented, we found that holding this event virtually once a week for 6 hours was a great way to keep the kind of deep engagement that a face-to-face meeting engenders. This is the second such publication of CI4CC Symposium highlights, the first covering the meeting that took place in Napa, California, from October 14-16, 2019. We conclude with some thoughts about using data science to learn from every child with cancer, focusing on emerging activities of the National Cancer Institute's Childhood Cancer Data Initiative.


Asunto(s)
COVID-19 , Informática Médica , Neoplasias , Adolescente , Niño , Ciencia de los Datos , Humanos , Neoplasias/epidemiología , Neoplasias/terapia , Pandemias , SARS-CoV-2 , Adulto Joven
15.
BMC Bioinformatics ; 22(1): 113, 2021 Mar 09.
Artículo en Inglés | MEDLINE | ID: mdl-33750288

RESUMEN

BACKGROUND: Automated text classification has many important applications in the clinical setting; however, obtaining labelled data for training machine learning and deep learning models is often difficult and expensive. Active learning techniques may mitigate this challenge by reducing the amount of labelled data required to effectively train a model. In this study, we analyze the effectiveness of 11 active learning algorithms on classifying subsite and histology from cancer pathology reports using a Convolutional Neural Network as the text classification model. RESULTS: We compare the performance of each active learning strategy using two differently sized datasets and two different classification tasks. Our results show that on all tasks and dataset sizes, all active learning strategies except diversity-sampling strategies outperformed random sampling, i.e., no active learning. On our large dataset (15K initial labelled samples, adding 15K additional labelled samples each iteration of active learning), there was no clear winner between the different active learning strategies. On our small dataset (1K initial labelled samples, adding 1K additional labelled samples each iteration of active learning), marginal and ratio uncertainty sampling performed better than all other active learning techniques. We found that compared to random sampling, active learning strongly helps performance on rare classes by focusing on underrepresented classes. CONCLUSIONS: Active learning can save annotation cost by helping human annotators efficiently and intelligently select which samples to label. Our results show that a dataset constructed using effective active learning techniques requires less than half the amount of labelled data to achieve the same performance as a dataset constructed using random sampling.


Asunto(s)
Aprendizaje Automático , Neoplasias , Algoritmos , Humanos , Neoplasias/genética , Neoplasias/patología , Redes Neurales de la Computación
16.
JAMA Intern Med ; 181(5): 672-679, 2021 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-33625463

RESUMEN

Importance: Understanding the effect of serum antibodies to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) on susceptibility to infection is important for identifying at-risk populations and could have implications for vaccine deployment. Objective: The study purpose was to evaluate evidence of SARS-CoV-2 infection based on diagnostic nucleic acid amplification test (NAAT) among patients with positive vs negative test results for antibodies in an observational descriptive cohort study of clinical laboratory and linked claims data. Design, Setting, and Participants: The study created cohorts from a deidentified data set composed of commercial laboratory tests, medical and pharmacy claims, electronic health records, and hospital chargemaster data. Patients were categorized as antibody-positive or antibody-negative according to their first SARS-CoV-2 antibody test in the database. Main Outcomes and Measures: Primary end points were post-index diagnostic NAAT results, with infection defined as a positive diagnostic test post-index, measured in 30-day intervals (0-30, 31-60, 61-90, >90 days). Additional measures included demographic, geographic, and clinical characteristics at the time of the index antibody test, including recorded signs and symptoms or prior evidence of coronavirus 2019 (COVID) diagnoses or positive NAAT results and recorded comorbidities. Results: The cohort included 3 257 478 unique patients with an index antibody test; 56% were female with a median (SD) age of 48 (20) years. Of these, 2 876 773 (88.3%) had a negative index antibody result, and 378 606 (11.6%) had a positive index antibody result. Patients with a negative antibody test result were older than those with a positive result (mean age 48 vs 44 years). Of index-positive patients, 18.4% converted to seronegative over the follow-up period. During the follow-up periods, the ratio (95% CI) of positive NAAT results among individuals who had a positive antibody test at index vs those with a negative antibody test at index was 2.85 (95% CI, 2.73-2.97) at 0 to 30 days, 0.67 (95% CI, 0.6-0.74) at 31 to 60 days, 0.29 (95% CI, 0.24-0.35) at 61 to 90 days, and 0.10 (95% CI, 0.05-0.19) at more than 90 days. Conclusions and Relevance: In this cohort study, patients with positive antibody test results were initially more likely to have positive NAAT results, consistent with prolonged RNA shedding, but became markedly less likely to have positive NAAT results over time, suggesting that seropositivity is associated with protection from infection. The duration of protection is unknown, and protection may wane over time.


Asunto(s)
Prueba de Ácido Nucleico para COVID-19 , Prueba Serológica para COVID-19 , COVID-19 , Susceptibilidad a Enfermedades , SARS-CoV-2 , Adulto , Factores de Edad , Anticuerpos Antivirales/aislamiento & purificación , COVID-19/sangre , COVID-19/diagnóstico , COVID-19/epidemiología , COVID-19/prevención & control , Prueba de Ácido Nucleico para COVID-19/métodos , Prueba de Ácido Nucleico para COVID-19/estadística & datos numéricos , Prueba Serológica para COVID-19/métodos , Prueba Serológica para COVID-19/estadística & datos numéricos , Correlación de Datos , Susceptibilidad a Enfermedades/diagnóstico , Susceptibilidad a Enfermedades/epidemiología , Susceptibilidad a Enfermedades/inmunología , Femenino , Humanos , Masculino , Persona de Mediana Edad , SARS-CoV-2/inmunología , SARS-CoV-2/aislamiento & purificación , Estudios Seroepidemiológicos , Evaluación de Síntomas/métodos , Evaluación de Síntomas/estadística & datos numéricos , Estados Unidos/epidemiología , Esparcimiento de Virus/inmunología
17.
Arch Pathol Lab Med ; 145(2): 222-226, 2021 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-33501497

RESUMEN

CONTEXT.­: The Surveillance, Epidemiology, and End Results (SEER) cancer registry program is currently evaluating the use of archival, diagnostic, formalin-fixed, paraffin-embedded (FFPE) tissue obtained through SEER cancer registries, functioning as honest brokers for deidentified tissue and associated data. To determine the feasibility of this potential program, laboratory policies for sharing tissue for research needed to be assessed. OBJECTIVE.­: To understand the willingness of pathology laboratories to share archival diagnostic tissue for cancer research and related policies. DESIGN.­: Seven SEER registries administered a 27-item questionnaire to pathology laboratories within their respective registry catchment areas. Only laboratories that processed diagnostic FFPE specimens and completed the questionnaire were included in the analysis. RESULTS.­: Of the 153 responding laboratories, 127 (83%) responded that they process FFPE specimens. Most (n = 88; 69%) were willing to share tissue specimens for research, which was not associated with the number of blocks processed per year by the laboratories. Most laboratories retained the specimens for at least 10 years. Institutional regulatory policies on sharing deidentified tissue varied considerably, ranging from requiring a full Institutional Review Board review to considering such use exempt from Institutional Review Board review, and 43% (55 of 127) of the laboratories did not know their terms for sharing tissue for research. CONCLUSIONS.­: This project indicated a general willingness of pathology laboratories to participate in research by sharing FFPE tissue. Given the variability of research policies across laboratories, it is critical for each SEER registry to work with laboratories in their catchment area to understand such policies and state legislation regulating tissue retention and guardianship.


Asunto(s)
Laboratorios/legislación & jurisprudencia , Neoplasias/patología , Políticas , Investigación/legislación & jurisprudencia , Programa de VERF/legislación & jurisprudencia , Formaldehído , Humanos , Neoplasias/diagnóstico , Adhesión en Parafina , Patología , Fijación del Tejido
18.
IEEE Trans Emerg Top Comput ; 9(3): 1219-1230, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-36117774

RESUMEN

Population cancer registries can benefit from Deep Learning (DL) to automatically extract cancer characteristics from the high volume of unstructured pathology text reports they process annually. The success of DL to tackle this and other real-world problems is proportional to the availability of large labeled datasets for model training. Although collaboration among cancer registries is essential to fully exploit the promise of DL, privacy and confidentiality concerns are main obstacles for data sharing across cancer registries. Moreover, DL for natural language processing (NLP) requires sharing a vocabulary dictionary for the embedding layer which may contain patient identifiers. Thus, even distributing the trained models across cancer registries causes a privacy violation issue. In this paper, we propose DL NLP model distribution via privacy-preserving transfer learning approaches without sharing sensitive data. These approaches are used to distribute a multitask convolutional neural network (MT-CNN) NLP model among cancer registries. The model is trained to extract six key cancer characteristics - tumor site, subsite, laterality, behavior, histology, and grade - from cancer pathology reports. Using 410,064 pathology documents from two cancer registries, we compare our proposed approach to conventional transfer learning without privacy-preserving, single-registry models, and a model trained on centrally hosted data. The results show that transfer learning approaches including data sharing and model distribution outperform significantly the single-registry model. In addition, the best performing privacy-preserving model distribution approach achieves statistically indistinguishable average micro- and macro-F1 scores across all extraction tasks (0.823,0.580) as compared to the centralized model (0.827,0.585).

19.
medRxiv ; 2020 Dec 20.
Artículo en Inglés | MEDLINE | ID: mdl-33354682

RESUMEN

Importance There is limited evidence regarding whether the presence of serum antibodies to SARS-CoV-2 is associated with a decreased risk of future infection. Understanding susceptibility to infection and the role of immune memory is important for identifying at-risk populations and could have implications for vaccine deployment. Objective The purpose of this study was to evaluate subsequent evidence of SARS-CoV-2 infection based on diagnostic nucleic acid amplification test (NAAT) among individuals who are antibody-positive compared with those who are antibody-negative, using real-world data. Design This was an observational descriptive cohort study. Participants The study utilized a national sample to create cohorts from a de-identified dataset composed of commercial laboratory test results, open and closed medical and pharmacy claims, electronic health records, hospital billing (chargemaster) data, and payer enrollment files from the United States. Patients were indexed as antibody-positive or antibody-negative according to their first SARS-CoV-2 antibody test recorded in the database. Patients with more than 1 antibody test on the index date where results were discordant were excluded. Main Outcomes/Measures Primary endpoints were index antibody test results and post-index diagnostic NAAT results, with infection defined as a positive diagnostic test post-index, as measured in 30-day intervals (0-30, 31-60, 61-90, >90 days). Additional measures included demographic, geographic, and clinical characteristics at the time of the index antibody test, such as recorded signs and symptoms or prior evidence of COVID-19 (diagnoses or NAAT+) and recorded comorbidities. Results We included 3,257,478 unique patients with an index antibody test. Of these, 2,876,773 (88.3%) had a negative index antibody result, 378,606 (11.6%) had a positive index antibody result, and 2,099 (0.1%) had an inconclusive index antibody result. Patients with a negative antibody test were somewhat older at index than those with a positive result (mean of 48 versus 44 years). A fraction (18.4%) of individuals who were initially seropositive converted to seronegative over the follow up period. During the follow-up periods, the ratio (CI) of positive NAAT results among individuals who had a positive antibody test at index versus those with a negative antibody test at index was 2.85 (2.73 - 2.97) at 0-30 days, 0.67 (0.6 - 0.74) at 31-60 days, 0.29 (0.24 - 0.35) at 61-90 days), and 0.10 (0.05 - 0.19) at >90 days. Conclusions Patients who display positive antibody tests are initially more likely to have a positive NAAT, consistent with prolonged RNA shedding, but over time become markedly less likely to have a positive NAAT. This result suggests seropositivity using commercially available assays is associated with protection from infection. The duration of protection is unknown and may wane over time; this parameter will need to be addressed in a study with extended duration of follow up.

20.
Lancet Oncol ; 21(9): e444-e451, 2020 09.
Artículo en Inglés | MEDLINE | ID: mdl-32888473

RESUMEN

Population-based cancer registries (PBCRs) generate measures of cancer incidence and survival that are essential for cancer surveillance, research, and cancer control strategies. In 2014, the Toronto Paediatric Cancer Stage Guidelines were developed to standardise how PBCRs collect data on the stage at diagnosis for childhood cancer cases. These guidelines have been implemented in multiple jurisdictions worldwide to facilitate international comparative studies of incidence and outcome. Robust stratification by risk also requires data on key non-stage prognosticators (NSPs). Key experts and stakeholders used a modified Delphi approach to establish principles guiding paediatric cancer NSP data collection. With the use of these principles, recommendations were made on which NSPs should be collected for the major malignancies in children. The 2014 Toronto Stage Guidelines were also reviewed and updated where necessary. Wide adoption of the resultant Paediatric NSP Guidelines and updated Toronto Stage Guidelines will enhance the harmonisation and use of childhood cancer data provided by PBCRs.


Asunto(s)
Guías como Asunto/normas , Neoplasias/terapia , Pediatría/tendencias , Pronóstico , Niño , Atención a la Salud , Humanos , Estadificación de Neoplasias , Neoplasias/epidemiología , Sistema de Registros
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...